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[57] ABSTRACT 

Systems and methods for improved speech acquisition 
are disclosed including a plurality of linearly arrayed 
sensors to detect spoken input and to output signals in 
response thereto, a beamformer connected to the sen- 
sors to cancel a preselected noise portion of the signals 
to thereby produce a processed signal, and a speech 
recognition system to recognize the processed signal 
and to respond thereto. The beamformer may also in- 
clude an adaptive filter with enable/disable circuitry for 
selectively training the adaptive filter a predetermined 
period of time. A highpass filter may also be used to 
filter a preselected noise portion of the sensed signals 
before the signals are forwarded to the beamformer. 
The speech recognition system may include a speaker 
independent base which is able to be adapted by a pre- 
determined amount of ttaining by a speaker, and which 
system includes a voice dialer or a speech coder for 
telecommunication. 
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1 2 

invention. This system adaptation process improves the 
SYSTEM AND METHOD FOR IMPROVED speech recognition accuracy substantially. 

SPEECH ACQUISITION FOR HANDS-FREE In another embodiment of the present invention, the 

VOICE TELECOMMUNICATION IN A NOISY beamformer includes an adaptive filter with enable/dis- 
ENVIRONMENT 5 able circuitry for selectively training the adaptive filter 

a predetermined period of time. This effectively adapts 
BACKGROUND OF THE INVENTION the speech acquisition system of the present invention to 

1 Field of the Invention ^ e n0 ^ se cnaracte " s tics present when a speaker uses the 

Without limiting its scope, this invention relates to ln ^ony. This strategy, referred to below as 

voice processing and more particularly to a system and 10 adapt-and-stop strategy greatly unproves no.se can- 
method for improved speech acquisition durmg hands- ce ^°" ^thoutjiigmficant speech signal cancellation 
free voice telecommunication in a noisy background ^ ^tomon nsks of traditional adaptive algonfcms. A 
environment further embodiment employs an additional hignpass 

^Description of the Related Art , , J*" t0 °* * P™**** lower frequency band of 

Hands-free cellular telephony requires dialing by 15 [^f 05 ^ J*** thereby removing ;or reducing certam 
. " ' " UMU "6 u / background noise before the signals are forwarded to 

voice with a speech recognition system. Even so, it is te&mtanatt. 

known that m a no>sy vehicular environment such as, These and other features and advantages of the inven- 
for example, an automobde, a speech recognition sys- tion ^ ^ ^ skmed ^ from ^ 

tern performs poorly with a single microphone input 2Q foUowin de *£, ed descri tion of a referred embodi . 

unless the microphone is close to the driver s mouth, as ^ . ^rV +u a I™,; • , 

ruj x j i_ j i_ i j /i_ i . % ment, taken together with the accompanying drawings, 
is the case of a head-mounted or hand-held (handset) ^ wn j cn . 

microphone. However, this arrangement is not desir- 
able from a safety, legal or human-factors viewpoint. DESCRIPTION OF THE DRAWINGS 
For example, a driver's ration time to road events 25 FIG. 1 is a schematic drawing of a preferred embodi- 
takes significantly longer while using a hand-held cellu- ment of ^ ^ myention; 

lar phone. Also safety reguktions in many countries FIG. 2 is a schematic drawing of an adaptive filter for 

prohibit using hand-held cellular telephones in road a generalized sidelobe canceler shown in FIG. 1; 

vehicles while driving. FIG. 3 is a representation of an input signal; and 

Accordingly, improvements which overcome any or 30 piG. 4 is a schematic drawing depicting another 

all of these problems are presently desirable. embodiment of the present invention; and 

SUMMARY OF THE INVENTION FIG. 5 is a flowchart of another embodiment of the 

present invention. 

In view of the above problems associated with the Corresponding numerals and symbols in the different 

related art, it is an object of the present invention to 35 figures re f er to corresponding parts unless otherwise 

provide a system and method for improved speech indicated, 
acquisition and enhancement to thereby enable more 

reliable speech recognition in a noisy environment, DETAILED DESCRIPTION OF PREFERRED 

which is particularly necessary for reliable performance EMBODIMENTS 

of hands-free voice dialing. 40 While various preferred embodiments described 

It is another object of the present invention to pro- below will use the example of a driver/speaker in an 

vide a system and method for improving the speech automobile environment, this should in no way limit the 

recognition rate to enhance hands-free telecommunica- scope of the present invention. For example, such in- 

tions in a vehicular environment. vention would be similarly useful in the cockpit of an 

It is a further object of the present invention to pro- 45 airplane, the cab of a truck, the cabin of a boat or any 
vide a system and method for improving speech acquisi- no j sy environment where hands-free voice communica- 
tion system to enhance the speech signal quality and UO ns is desirable. Moreover, it will be understood by 
thereby cause the output of a speech coder to sound those skilled in the art that the improved speech acquisi- 
better than that obtained using a single microphone. tion system and method of the present invention will 

These and other objects are accomplished in the sys- 50 greatly increase the accuracy of speech recognition for 

terns and methods of the present invention. In a pre- verbal commands to a vehicle. 

ferred embodiment, enhanced speech acquisition is A preferred embodiment of the present invention's 

achieved by mounting a plurality of linearly arrayed speech acquisition system in a noisy environment is 

sensors in the interior vehicular compartment to detect depicted in FIG. 1. This system uses a linear array of 

spoken input and to output signals in response thereto. 55 sensors 10, preferably microphones, mounted inside a 

By taking advantage of the near-field, wideband aspects car and produces enhanced speech from the multi- 

of such a signal, a beamformer connected to the sensors microphone data using beamforming algorithms, 

cancels a significant amount of the noise in the signals in Speech enhancement using a microphone array and 

producing a processed target signal. A speech applica- beamforming is a multichannel data processing prob- 

tion system, which includes a voice dialer or a speech 60 lem. However, if the delays between the speaker (not 

coder for cellular telecommunication, or both, then shown) and microphones 10 are known or can be esti- 

receives the processed target signal and responds mated, they can be compensated to synchronize the 

thereto. The speech recognition system uses a speaker target signals from all channels, 

independent algorithm which is able to be adapted by a In one embodiment of the present invention the time 

predetermined amount of training by a speaker and 65 delay for each channel 12 associated with a microphone 

which is able to be adapted to the environmental factors 10 is computed and compensated, based on the geome- 

such as noise and reverberation as well as to the special try of microphones 10 and the desired target signal. It is 

aspects of the speech acquisition system of the present assumed the position of the speaker (the driver) is 
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known; the present invention has been effective even 
when the assumed position is only a reasonable estimate 
of the true value. Each channel 12 contains the desired 
target signal and interferences such as noise and jammer 
(e.g., another talker) signals. Let y*(t) denote the output 
at the k-th channel (12) after highpass filtering (highpass 
filter seen in FIG. 4). Then, 

where s(t) is the target signal, n*(t) is interference, and 
Tk is time delay between the target and the k-th micro- 
phone. 

The time delay for each channel signal is computed as 

where (x^z*) is the position of the target, (x*,yjfc,Z*) is 
the position of the k-th microphone (channel) and v is 
sound velocity. The delay-compensated signal is given 
by: 

**<0-M(r*) 



10 



15 



20 



Note that the time index t in the above equation is an 
integer while the delay estimate 25 



is an adaptive beamformer consisting of three parts, as 
shown in FIG. 2: conventional DS-BF 16,18,20, adapt- 
ive noise canceler ("ANC") 22,24,26, and adder 28 that 
produces an overall GSC output 34. 

Because target signals 18 of all channels 12 are syn- 
chronized after delay compensation 16, signals contain- 
ing essentially only noises are obtained at ANC 22,24,26 
by subtracting one channel from another. Thus, block- 
ing matrix 22, for example, can consist of "plus ones" 
and "minus ones" in appropriate places. Noise data 
from reduced channels 24 is processed through adaptive 
filter 26 to minimize the power of the output signal 34, 
which is equivalent to minimizing the noise content of 
GSC output 34. GSC 14 can therefore provide larger 
SNR gains than possible from DS-BF 16,18,20 alone. 

The output 34 of GSC 14 can be obtained by 

where the delay term T/2 in the argument of DS-BF 
output corresponds to the delay introduced by the 
ANC adaptive filter 26; Zan<&) is the noise canceling 
component; and 

Af T 
i= 1 1 



in general, is not. Therefore, zjt(t) is computed by inter- 
polation based on the discrete samples available around 30 
time 



t-T*. 



M 



W c « [FPy i = 1, . . . Mj » 1, . . . , Af 



A simple method used to compute the interpolated 35 with T being the number of taps or coefficients of each 
value for non-integer delays is: of the M' adaptive filters a#(t). Note that W c is an exam- 

ple blocking matrix 22 and its dimension M' can be 
y& - id = determined by selecting 



/>2 i * , 



sinc( L* - f 



45 



I 

-1 



1 

-1 



-1 



where [.] is the largest integer that is equal to or smaller 
than the argument, pi and p2 are lower and upper val- 
ues of time index to compute the interpolated value, and 
sinc(x)=sin(x)/x. 

The target signal is synchronized in all the delay- , w , , - , . . ^ _ ^ ^ . 
compensated channel data. Output 30 of delay-and-sum , n when M = M ~ \ A**?*™ f * er 26 coefficient a//t) is 
beamformer (hereinafter sometimes referred to as "DS- 50 com P uted h V ™ lteratlve method as 



BP') 16,18,20 is just the sum of delay-compensated 
multi-channel data 18. DS-BF output 30 is given by 



ZDS® = J[ j zjt(r)/Af 



55 



where M, the number of microphones 10, is used as a 
scale factor. Because DS-BF output 30 is generated by 
coherently adding the target signals and incoherently 
adding the interference signals, it has a higher signal-to- 60 
noise ratio (SNR) than the output from any of the indi- 
vidual microphones. Thus, a microphone array is pre- 
ferred to a single microphone. 

However, the SNR gain due to DS-BF 16,18,20 is 
ordinarily not sufficient for a vehicular application, as 65 
only a small number of microphones may be used for 
practical reasons. For this reason, the use of a general- 
ized sidelobe canceler ("GSC) 14 is preferred. GSC 14 



GSC 14 as described above, however, has a major 
drawback. If the target signals are not completely 
blocked out at adaptive filter input 24 (thereby causing 
target "signal leakage"), adaptive filter 26 will then 
adapt to the characteristics of the target signal; this 
results in partial cancellation of the target signal. Also, 
adaptive filter 26 may modify the spectral characteris- 
tics of output speech signal 34, which is undesirable for 
speech applications 40,44,60. 

The GSC beamformer 14 is implemented in the pres- 
ent invention for a near-field and wide-band signal (i.e., 
speech). If the minimum distance between the speaker 
and microphones 10 is greater than twice the total array 
aperture, it is reasonable to assume the far-field condi- 
tion. Actual measurements in a compact-size car, how- 
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ever, support only a near-field condition. For handling 
the near-field condition, it is assumed that the acoustic 
signal amplitude attenuates proportionately with dis- 
tance ("spherical spreading 1 '). Difference signals 24 in 
the lower branch in FIG. 1 are computed after compen- 5 
sating for the attenuation over the respective target-to- 
sensor distance. This method reduces the signal leakage 
mentioned above. Summer 20 at the upper branch in 
FIG. 1 is, however, implemented without such compen- 
sation, so as not to magnify the influence of noise trans- 10 
duced by microphones 10 farther from the speaker. 

To overcome the signal leakage problem, an embodi- 
ment of the present invention employs an effective 
adapt-and-stop method in GSC 14: adaptive filter 26 
adaptation ("adapt") occurs over a certain period of 15 
time, shown in FIG. 3 as portion "A", when only noise 
is present; adaptive filter 26 is then held fixed ("stop") 
after this initial adaptation period, designated as portion 
"B" in FIG. 3, when both speech and noise are present. 
This modification of GSC 14 is based on two assump- 20 
tions: 1) noise characteristics are not changing over a 
certain limited period of time and 2) adaptive filter 26 
can be trained ("adapted") over a short period of time 
(typically less than 5 seconds). Such modification is 
found to improve the SNR effectively without intro- 25 
ducing any serious signal cancellation or spectral distor- 
tion. Also, the modified GSC of the present invention is 
computationally more efficient because it does not 
adapt the filter coefficients during speech recognition 
for voice dialing or during speech coding for voice 30 
communication. Overall, the performance of this modi- 
fied GSC 14 is relatively stable, robust, and produces 
substantially lower recognition error rates than does a 
single microphone. 

One way of implementing the adapt-and-stop strat- 35 
egy of the present invention for cellular telecommunica- 
tion applications is to require the driver/speaker to push 
a button or flip a toggle switch (located, for example, on 
the steering column) indicating his intention to use the 
phone. This action triggers the ANC filter adaptation; 40 
after a few seconds, the adaptation stops and a green 
light (or some other indicator) signals the driver to start 
voice dialing. 

Looking now at FIG. 4, processed target output 
signal 34 from speech acquisition system 50 is then pro- 45 
vided to speech application 52. As seen, speech applica- 
tion 52 may consist of either speech coding mechanism 
44 for coding and forwarding to communication link 46, 
or speech recognition system 38 containing training 
aspects 60 and speech recognizer 40 for voice recogni- 50 
tion and forwarding to voice dialer 42, or both. It 
should be understood at this point that voice dialer 42 
may respond to either spoken digits or a spoken phrase 
such as "Call Home". 

Such adapt-and-stop modification in GSC 14 can be 55 
implemented as shown in FIG. 2. Enable/disable circuit 
switch 35 is toggled whenever adaptive filter 26 is to be 
trained or fixed, respectively. 

In a preferred embodiment, speech recognition sys- 
tem 38 is adaptable to the various aspects of the present 60 
speech acquisition system. For example, a speaker- 
independent speech recognition algorithm developed 
using a single-microphone-based speech database may 
be adapted to the use of multiple microphones, beam- 
former processing, background noise in the automobile, 65 
and the speech characteristics of the particular user 
(driver) of the system. Thus, a desired speech recogni- 
tion algorithm is trained on a specific talker (so it will be 
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speaker dependent) as well as on specific environment 
of automobile (e.g., noise characteristics of reverbera- 
tions, road noises). An example of a speech recognition 
algorithm which could be so adapted is disclosed in 
co-pending patent application Ser. No. 07/458,860 by 
G. Doddington, et al. entitled "Grammar-Based Check- 
sum constraints for High Performance Speech Recogni- 
tion Circuit". An algorithm used by one embodiment of 
the present invention is a speaker-independent continu- 
ous digit recognition algorithm using the Hidden Mar- 
kov Model (H1VTM), trained on speech data collected 
over long-distance telephone lines. It should be under- 
stood that such speech recognition could also be 
speaker dependent. 

While it may make sense in a cellular phone applica- 
tion to use a speaker-dependent speech recognition 
system for expected higher performance and secure 
access, for consumer acceptance, however, the required 
training must be short. A practical solution is to use a 
speaker-independent algorithm and train (or adapt) the 
observation probabilities of the HMM model with a 
very limited number of training utterances collected 
from the given user. This training paradigm is some- 
times referred to as "speaker adaptation." When the 
speaker-independent algorithm was trained on data 
collected over long-distance telephone lines, the obser- 
vation probabilities should be adapted to account for 
the variations in such factors as the transducer (micro- 
phone array 10 plus beamforming 14) and background 
noise, as well as to correspond to the given speaker. 
This adaptation of a speech recognition algorithm is 
referred to as "system adaptation" and results in sub- 
stantial gains in the recognition performance. 

Output 34 from speech acquisition system 50 may also 
be applied to speech coder 44. This is important because 
digital cellular systems use speech coding. In an imple- 
mentation of the present invention the US Federal Stan- 
dard 1016 4.8 kbits/s CELP (Code Excited Linear Pre- 
diction) algorithm speech coder was selected. Informal 
listening tests showed that the speech quality of the 
CELP coder output was better for GSC than for 
DS-BF and single microphone cases. Specific speech 
quality improvements due to GSC 14 include lower 
background noise level and more natural-sounding 
voice; the CELP output speech sounds rough for the 
DS-BF or single microphone cases. 

Returning now to FIG. 4, a schematic drawing of 
another embodiment of the present invention is set 
forth. Highpass filters 54 are applied to transduced data 
of each channel 12 to attenuate the data below a cutoff 
frequency of about 500 Hz. Spectrograms of speech 
data collected in local and highway driving conditions 
show the presence of strong low-frequency noise com- 
ponents below about 500 Hz. These noise components 
result from such factors as mechanical vibrations of a 
moving car and road noises. Moreover, most communi- 
cation systems (such as a telephone network) do not 
carry information below 300 Hz. Applying highpass 
filters 54 at 500 Hz does not hurt the performance of 
system adapted speech recognition 40 or speech coding 
44. Rather, since it cancels most of the noise due to an 
automobile's movement, it increases the recognition 
performance. An alternative implementation is to in- 
stead place highpass filter 54 at output 34 of GSC 14, 
but performance tends not to be as good. Furthermore, 
in implementation, highpass filter 54 may be combined 
with an anti-aliasing low pass filter. 
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E 'cal Inf ti the remaining 40 utterances for wOfO and all 60 

mpin orma on. utterances for each of the remaining conditions. For 
A hardware system inside a car was developed for example, the recognition error rates for SI and SA are 
speech data acquisition according to the present inven- given in Table 1 for single microphone, DS-BF and 
tion. This system included seven equally spaced omnidi- 5 GSC, under each of the three idling-car conditions, 

rectional microphones (5 cm apart) in a linear array and Since system adaptation demonstrated superior recogni- 

one head-mounted close-talking microphone, an eight- tion performance, it was included as a standard feature 

channel Sony DAT (Digital Audio Tape) recorder, and in all subsequent tests, 
an interface between the Sony DAT recorder and a Sun 

SPARCstation for transferring digitized multichannel 10 1ABLE I 

data. All collected data were processed on the SPARC- Performance comparison between speaker independent and 

Station. A DAT recorder was chosen for data acquisi- system adapted speech recognition (Error rates in percent) 
tion because of its portability. 

It should be understood that the present invention 

should not be limited to the hardware set forth above. 15 
For example, more or less microphones could be used in 
implementing the present invention, with the spacing 
therebetween established as desired. The spacing for 

this particular implementation was determined based on Speech recognition tests on data collected in a real 
the maximum frequency of speech, although other fre- 20 car according to teachings of the present system, under 
quency selections could be made, which would in turn several noise conditions showed that the adaptive beam- 
affect the microphone spacing. Further, after employ- former based GSC system yielded robust performance 
ing the omnidirectional microphones, it was found bet- over all tested conditions. In contrast, both a single 
ter performance was achieved when unidirectional mi- microphone and the non-adaptive delay-and-sum beam- 
crophones were used instead. 25 former produced, in general, inferior performance and 
Data was collected inside a compact-size car (Dodge exhibited relatively large swings in performance over 
Spirit) using the DATbased speech acquisition system different conditions. 

according to the teaching of the present invention. De- Looking next at FIG. 5, a flowchart of another em- 
tails of the pilot experiment are: 12 kHz data sampling bodiment for improved speech acquisition during 
rate per channel; one male talker; 60 utterances per 30 hands-free cellular telephone according to the present 
condition; three idling-car conditions and two driving invention is set forth. In the preferred embodiment, the 
conditions. As speech utterances, sixty randomly gener- system is adapted. First the adaptive filter is trained for 
ated 10-digit numbers were used to imitate actual phone a predetermined time (100) and then held fixed for the 
numbers. The three idling conditions used are: window spoken input (110) as described previously. Next spoken 
closed and fan off (wOfD); window closed and fan on 35 signals containing speech and noise aspects are detected 
(wOfl); and window open and fan off (wlfO). For the (120). As described above, it is preferred that a linear 
wlfO condition, the car was parked right near a freeway array of sensors is used for such detection. The channels 
and was exposed to a high noise level Most people of input from each sensor are then filtered with a high- 
would consider this condition annoying, although such pass filter to remove the lower frequencies (130). A 
could be the situation if one were in an accident or 40 beamformer then processes the filtered signals (140). 
required some sort of assistance while on the highway. The beamformer contains an adaptive filter with ena- 
The two driving conditions used were highway driving ble/disable circuitry, as discussed above, for selectively 
at 55 MPH and local driving. The local driving condi- adapting said filter for a predetermined period of time, 
tion included stops at signal lights, acceleration, braking A speech application system then recognizes (150) the 
and turning. The digitized data was transferred to the 45 speech aspects of such processed signal and responds 
SPARCstation, where it was downsampled to 8 kHz. thereto (160). 

The data collected was processed using the two It should be understood that various embodiments of 
beamforming algorithms, DS-BF and GSC, for each of the invention can employ or be embodied in hardware, 

the five conditions. SNRs were computed for the center software or microcoded firmware. Process diagrams 

microphone in the array, for DS-BF output, and for 50 are also representative of flow diagrams for microcoded 

GSC output. Both single microphone and DS-BF cases and software based embodiments. Furthermore, while a 

produce equivalent SNRs ranging from — 2 dB to 12 specific embodiment of the invention has been shown 

dB; GSC provides SNR gains ranging from 2 dB to 9 and described, various modifications and alternate em- 

dB over SNR gains generated by a single microphone. bodiments will occur to those skilled in the art Accord- 

The performance of the speaker-independent case 55 ingly, it is intended that the invention be limited only in 

(SI) was compared with the performance of the system- terms of the appended claims, 

adapted case (SA). Recall that system adaptation re- I claim: 

quires the driver to provide a small amount of training 1. A system for an improved acquisition of speech 

speech in the car. Ideally, the system adaptation yields comprising: 

the best performance gain when the training speech is 60 a plurality of linearly arrayed sensors to detect spo- 

collected in the same condition as that used for testing. ken input and to output signals in response thereto; 

However, this tends not to be feasible in practice. For a a highpass filter to filter a preselected first noise por- 

number of reasons including safety and user acceptance, tion from said signals received from said linearly 

this training is best performed in an idling condition arrayed sensors; 

(i.e., in a parked car). 65 a beamformer, connected to said highpass filter, to 

During initial tests, 20 utterances collected in the cancel a preselected second noise portion from said 

wOflO condition were used for system adaptation. In filtered signals thereby producing a processed sig- 

recognition testing for the system-adapted case, we nal; and 
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a speech application system, connected to receive 
said processed signal from said beamformer, for 
receiving and responding to said processed signal, 
wherein said beamformer is a generalized sidelobe 
canceler, comprising a delay and sum beamformer, S 
an adaptive noise canceler and an adder which 
produces said processed signal, 

and wherein said adaptive noise canceler comprises a 
blocking matrix, an adaptive filter and an adder, 
and wherein said adaptive filter comprises enable/- 10 
disable circuitry for selectively adapting said 
adaptive filter for a predetermined period of time. 

2. A system for an improved acquisition of speech 
comprising: 

a plurality of linearly arrayed sensors to detect spo- 15 
ken input and to output signals in response thereto; 

a highpass filter to filter a preselected first noise por- 
tion from said signals received from said linearly 
arrayed sensors; 

a beamformer, connected to said highpass filter, to 20 
cancel a preselected second noise portion from said 
filtered signals thereby producing a processed sig- 
nal; and 

a speech application system, connected to receive 
said processed signal from said beamformer, for 25 
receiving and responding to said processed signal, 
wherein said speech application system is a speech 
coder with a communications link. 

3. A system for an improved acquisition of speech 
comprising: 30 

a plurality of linearly arrayed sensors to detect spo- 
ken input and to output signals in response thereto; 

a highpass filter to filter a preselected first noise por- 
tion from said signals received from said linearly 
arrayed sensors; 35 

a beamformer, connected to said highpass filter, to 
cancel a preselected second noise portion from said 
filtered signals thereby producing a processed sig- 
nal; and 

a speech application system, connected to receive 40 
said processed signal from said beamformer, for 
receiving and responding to said processed signal, 
wherein said speech application system is a speech 
recognition system. 

4. The system of claim 3, wherein said speech recog- 45 
nition system includes a speaker recognition algorithm 
which is adaptable by a predetermined amount of train- 
ing by a speaker. 

5. The system of claim 4, wherein said speaker recog- 
nition algorithm is speaker independent SO 

6. The system of claim 3, wherein said speech recog- 
nition system uses a Hidden Markov model speech rec- 
ognition algorithm. 

7. A system for improved acquisition of speech, com- 
prising of: 55 

a plurality of linearly arrayed sensors to detect spo- 
ken input and to output signals in response thereto; 

a beamformer, connected to said linearly arrayed 
sensors, to cancel a preselected noise portion of 
said signals thereby producing a processed signal, 60 
said beamformer including an adaptive noise can- 
celer and an adder which produces said processed 
signal; and 

a speech application system, connected to receive 
said processed signal from said beamformer, for 65 
receiving and responding to said processed signal, 
wherein said adaptive noise canceler comprises a 
blocking matrix, an adaptive filter and an adder, 
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and wherein said adaptive filter further comprises 
enable/disable circuitry for selectively adapting 
said adaptive filter for a predetermined period of 
time. 

8. A system for improved acquisition of speech, com- 
prising of: 

a plurality of linearly arrayed sensors to detect spo- 
ken input and to output signals in response thereto; 

a beamformer, connected to said linearly arrayed 
sensors, to cancel a preselected noise portion of 
said signals thereby producing a processed signal, 
said beamformer including an adaptive noise can- 
celer and an adder which produces said processed 
signal; and 

a speech application system, connected to receive 
said processed signal from said beamformer, for 
receiving and responding to said processed signal, 
wherein said speech application system includes a 
speech coder and a communications link. 

9. A system for improved acquisition of speech, com- 
prising of: 

a plurality of linearly arrayed sensors to detect spo- 
ken input and to output signals in response thereto; 

a beamformer, connected to said linearly arrayed 
sensors, to cancel a preselected noise portion of 
said signals thereby producing a processed signal, 
said beamformer including an adaptive noise can- 
celer and an adder which produces said processed 
signal; and 

a speech application system, connected to receive 
said processed signal from said beamformer, for 
receiving and responding to said processed signal, 
wherein said speech application system is a speech 
recognition system. 

10. The system of claim 9, wherein said speech recog- 
nition system includes a speaker recognition algorithm 
which is adaptable by a predetermined amount of train- 
ing by a speaker. 

11. The system of claim 10, wherein said speaker 
recognition algorithm is speaker independent. 

12. The system of claim 9, wherein said speech recog- 
nition system uses a Hidden Markov Model speech 
recognition algorithm. 

13. A system for improved acquisition of speech, 
comprising: 

a plurality of linearly arrayed sensors to detect spo- 
ken input and to output signals in response thereto; 

a beamformer, connected to said linearly arrayed 
sensors, to cancel a preselected noise portion of 
said signals thereby producing a processed signal; 
and 

a speech recognition system, connected to receive 
said processed signal from said beamformer, for 
receiving and responding to said processed signal, 
said speech recognition system including a speaker 
recognition algorithm which is adaptable by a pre- 
determined amount of training by a speaker. 

14. They system of claim 13, wherein said system 
further comprises a highpass filter connected to said 
beamformer to filter a preselected noise portion of said 
signals received from said linearly arrayed sensors be- 
fore transmitting such filtered signals to said beam- 
former. 

15. The system of claim 13, wherein said beamformer 
is a generalized sidelobe canceler, comprising a delay 
and sum beamformer, an adaptive noise canceler and an 
adder which produces said processed signal. 
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16. The system of claim 15, wherein said adaptive 
noise canceler comprises a blocking matrix, an adaptive 
Miter and an adder. 

17. The system of claim 16, wherein said adaptive 
filter further comprises enble/disable circuitry for se- 5 
lectively adapting said adaptive filter for a predeter- 
mined period of time. 

18. The system of claim 13, wherein said speech rec- 
ognition system uses a Hidden Markov Model speech 
recognition algorithm. 10 

19. A method for improved acquisition of speech 
during hands-free cellular telephony, comprising the 
steps of: ' 

detecting spoken signals, including a speech portion 
and a noise portion, with linearly arrayed sensors; IS 

filtering a preselected portion of said noise portion of 
said signals by a highpass filter; 

processing said filtered signals with a beamformer 
including an adaptive filter and enable/disable cir- 

20 



cuitry for selectively adapting said adaptive filter 

for a predetermined period of time; 
recognize said speech portion; and 
responding to said processed signal. 

20. The method of claim 19, wherein said step of 
responding to said processed signal includes the step of 
dialing a number associated with said speech portion. 

21. The method of claim 19, wherein said step of 
recognizing said speech portion further includes the 
step of employing a Hidden Markov Model speech 
recognition algorithm to aid in recognizing such speech 
portion. 

22. The method of claim 19, wherein the method 
further comprises the step of adapting an adaptive filter 
for a predetermined period of time and the step of stop- 
ping said adaptation of said adaptive filter after said step 
of detecting step. 
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